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10 BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention relates generally to methods used 
in implementation of the network Transmission Control 
15 Protocol, and more specifically to congestion control 
methods used in the Transmission Control Protocol . 

Description of Related Art 

The network Transmission Control Protocol (TCP) is 

2 0 well known. TCP runs only in end systems 100, 110 on a 
network 12 0 and not in intermediate network elements 
(not shown) such as routers or bridges. The 
intermediate network elements do not maintain TCP 
connection state. A TCP connection between end 

25 systems 100, 1.10, sometimes called hosts 100, 110, 
provides full duplex data transfer between 
hosts 100, 110. A TCP connection is always point-to- 
point, i.e., between a single TCP sender and a single 
TCP receiver. 

30 When a TCP connection is established, two 

application processes 101, 111 can send data to each 
other. Data is transmitted in segments. A segment 
includes header fields and a data field. The data 
field contains application data. The amount of 

35 application data that can be placed in the segment data 
field by the TCP sender is a maximum segment size. The 
TCP sender monitors flow control by monitoring an 
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advertised receive window rwnd for the TCP receiver, 
which is the size of the TCP receive buffer. 

Two header fields of interest are the sequence 
number field and the acknowledgement number field. As 
5 is known, in TCP, data is viewed as an unstructured, 
but ordered, stream of bytes. The sequence numbers 
used in TCP are over the stream of transmitted bytes. 
The sequence number that is placed in the segment 
number field is the byte-stream number of the first 

10 byte in the segment. 

The acknowledge number placed in the 
acknowledgement number field, for example by host 110 
in an acknowledgement packet being transmitted to 
host 100, is the sequence number of the next byte of 

15 data that host 110 is expecting from host 100. Since 
according to TCP, host 110 only acknowledges bytes up 
to the first missing byte in the data stream, host 110 
may provide cumulative acknowledgements of a single 
segment when segments are received out of order, the 

2 0 cumulative acknowledgements for the single segment are 

referred to as duplicate acknowledgements. 

Each time host 100 sends a segment into a TCP 
connection, a timer is started. If the segment timer 
expires before host 100 receives an acknowledgement for 
25 the data in the segment from host 110, host 100 resends 
the . segment and initiates a slow start process. 
Typically, the timeout value for the timer is not much 
larger than a round-trip time between hosts 100 
and 110. The round-trip time is the time from when a 

3 0 segment is transmitted to when an acknowledgement for 

that segment is received. 

A segment can fail to reach host 110 due to data 
corruption or more usually as a result of network 
congestion. Most losses on the Internet are caused by 
35 congestion as routers run out of buffers and discard 
incoming traffic. 

TCP typically runs on top of the Internet 
Protocol (IP) . TCP must use end-to-end congestion 
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control because the IP layer provides no feedback to 
end systems concerning network congestion. Four TCP 
congestion control processes are described in Request 
for Comments (RFC) 2581. The four are slow start, 
5 congestion avoidance, fast retransmit, and fast 
recovery . 

Typically, for slow start, a slow start 
threshold ssthresh is maintained and for congestion 
avoidance, a congestion window cwnd is maintained for 
10 each connection. Congestion window cwnd is a sender 
side limit on the amount of data that the TCP sender 
can transmit into network 12 0 before receiving an 
acknowledgement . 


15 segment, TCP receiver 110 generates an immediate 

acknowledgement for the first missing segment, which is 
a duplicate acknowledgement. Since TCP sender 100 does 
not know whether the duplicate acknowledgement is 
caused by a lost segment, a reordering of the segments 

20 during transmission, or replication of an 

acknowledgement packet or a segment by network 120, TCP 
sender 100 waits to receive a small number of duplicate 
acknowledgements, typically three, before assuming a 
segment is lost. If a reordering of the segments is 

25 the problem, there are only one or two duplicate 
acknowledgements before the reordered segment is 
processed and a new acknowledgement generated. 

When the third duplicate acknowledgement is 
received, the TCP sender immediately retransmits the 

3 0 oldest unacknowledged segment without waiting for the 

segment transmission timer to expire. This is known in 
the art as a fast retransmit . 

Following the fast retransmit, fast recovery, but 
not slow start, is performed. Fast recovery governs 

35 the transmission of data by TCP sender 100 until a non- 
duplicate acknowledgement is received. 
. ^ When one or more segments are lost in network 12 0, 
network 120 is assumed congested by TCP sender 100. 


When a TCP receiver 110 receives an out-of-order 
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After recovery from the loss, TCP 1 s congestion 
avoidance mechanism arranges to continue transmission 
at half the rate, which was obtained previously. The 
objective of fast recovery is to maintain transmission 
5 (at the reduced rate) while the recovery is being 
performed . 

In fast recovery, the size of congestion 
window cwnd is reduced and slow start 
threshold ssthresh is set equal to the reduced size 

10 congestion window. As soon as is possible with the 
reduced size congestion window, another packet is 
transmitted over the network in response to each 
additional duplicate acknowledgement packet received 
from TCP receiver 110, and the size of congestion 

15 window cwnd is increased by the size of one segment. 
The principle behind this is that each duplicate 
acknowledgement implies the receiver has received a 
segment although the sender does not know which one. 
Since this means that a segment has left the network, 

2 0 the sender can insert one more segment without 
worsening the congestion. 

When a non-duplicate acknowledgement is received 
by TCP sender 100, congestion window cwnd is deflated 
by setting window equal to slow start 

25 threshold ssthresh, and the number of duplicate 

acknowledgements is set to zero. If only a single 
segment is lost in a round trip time, fast retransmit 
and fast recovery perform satisfactorily and data 
transmission continues at the reduced rate in an 

30 attempt to avoid congestion on network 120. 

However, fast recovery does not always result in a 
smooth recovery when there are multiple lost segments 
in a single round trip time. Since the number of 
duplicate acknowledgements is set to zero upon leaving 

35 fast recovery, TCP sender 100 must receive three new 
duplicate acknowledgements after termination of fast 
recovery before TCP sender 100 can determine that 
another drop has occurred and perform another fast 
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retransmit. If the reduced size congestion window cwnd 
is such that less than three new segments can be 
transmitted by TCP sender 100, TCP receiver 110 does 
not generate three duplicate acknowledgements and a 
5 retransmission timeout occurs. Similarly, if the new 
drop occurs almost a round trip time after the first 
drop, there may not be sufficient time available for 
generation of three duplicate acknowledgements, and 
again a retransmission timeout occurs. 

10 As network congestion increases , and multiple 

drops in a single round trip time occur, the 
performance of network 12 0 is degraded because fast 
recovery cannot handle multiple drops in a single round 
trip time. Hence, a new method is needed for 

15 recovering from such a situation. 

SUMMARY OF THE INVENTION 

According to the principles of this invention, a 
novel fast recovery extended method is used to enhance 
20 the performance of TCP fast recovery when multiple 

segment losses occur within a single round trip time 
between a TCP sender and a TCP receiver. Unlike the 
prior art fast recovery technique that worked well in 
only limited circumstances, the fast recovery extended 

2 5 method enhances recovery for second and subsequent 

segment losses within a single round trip time. 

In one embodiment, a transmission control protocol 
congestion avoidance method includes performing a TCP 
fast recovery process by a TCP sender, and performing a 

3 0 TCP fast recovery extended process by the TCP sender 

upon receiving acknowledgement of receipt of new data 
from a TCP receiver in the TCP fast recovery process. 
The fast recovery extended process determines, 
following receipt of the acknowledgement of receipt of 
3 5 new data, an excess number of duplicate 

acknowledgements based upon a count of consecutive 
duplicate acknowledgement packets. The fast recovery 
extended process takes a network packet transmission 
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recovery action based upon the excess number of 
duplicate acknowledgements, and then stores the excess 
number of duplicate acknowledgements as a number of 
duplicate acknowledgements for further use in 
5 congestion avoidance. 

The network packet transmission recovery actions 
include : 


Taking no further action; 
10 Deflating a size of a congestion window cwnd; 

Optimizing a size of the congestion window cwnd; 

Performing a second fast retransmit; 

Resizing the optimized size congestion 
wi ndow cwnd ; and 
15 Resizing the deflated size congestion window cwnd. 

Also, only the TCP sender needs to have the fast 
recovery extended method available. No changes are 
required of a TCP receiver. The retention and use of 

2 0 the knowledge provided by the number of duplicate 

acknowledgements results in a smoother and faster 
recovery from multiple losses in a single round trip 
time than was previously possible. Consequently, the 
fast recovery extended method of this invention makes 
25 more efficient use of the network, and improves 
throughput for the individual user. 

In another embodiment," a transmission control 
protocol method includes performing a TCP fast recovery 
process, and performing a TCP fast recovery extended 

3 0 process upon receiving acknowledgement of receipt of 

new data in the TCP fast recovery process. 

According to the principles of this invention, a 
network device includes a processor and a memory 
coupled to the processor. The memory stores a fast 
35 recovery extended method wherein upon execution of the 
fast recovery extended method by the processor a fast 
recovery process is extended. The memory is either a 
volatile memory, a non-volatile memory, or a 
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combination of the two. In addition, the network 
device includes a network interface. The fast recovery 
extended method stored in the memory includes 

determining, upon receiving acknowledgement 
5 of receipt of new data by the network device, an 

excess "number of duplicate acknowledgements based 
upon a count of consecutive duplicate 
acknowledgement packets ; 

taking a network packet transmission recovery 
10 action based upon the excess number of duplicate 

acknowledgements ; and 

storing the excess number of duplicate 
acknowledgements in the memory as a number of 
duplicate acknowledgements .- 
15 Another embodiment of the network device of this 

invention includes means for performing a TCP fast 
recovery process, and means for performing a TCP fast 
recovery extended process upon receiving 
acknowledgement of receipt of new data in the TCP fast 
20 recovery process. Yet another embodiment of the 
network device includes : 

means for determining, upon receiving 
acknowledgement of receipt of new data, an excess 
number of duplicate acknowledgements based upon a 

2 5 count of consecutive duplicate acknowledgement 

packets; 

means for taking a network packet 
transmission recovery action based upon the excess 
number of duplicate acknowledgements; and 

3 0 means for storing the excess number of 

duplicate acknowledgements as a number of 
duplicate acknowledgements . 
Another feature of this invention is a 
programmable memory including a fast recovery extended 
3 5 method wherein the fast recovery extended method upon 
execution includes : 

determining, upon receiving acknowledgement 
of receipt of new data, an excess number of 


-7- 



duplicate acknowledgements based upon a count of 
consecutive duplicate acknowledgement packets; 

taking a network packet transmission recovery- 
action based upon the excess number of duplicate 
5 acknowledgements; and 

storing the excess number of duplicate 
acknowledgements as a number of duplicate 
acknowledgements . 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a prior art block diagram of two TCP 
hosts interconnected over a network. 

Figure 2A is a block diagram of a TCP sender and 
15 receiver interconnected over a network where the TCP 

sender includes the novel fast recovery extended method 
of this invention. 

Figure 2B is a high level process flow diagram of 
the fast recovery extended method of this invention. 
20 Figure 2C is a more detailed diagram of fast 

recovery extended state variables that are stored in a 
memory. 

Figures 3A and 3B are a detailed process flow 
diagram of one embodiment of the fast recovery extended 
25 method of this invention. 

Herein, elements with the same reference numeral 
are the same element. 

DETAILED DESCRIPTION 

30 

According to the principles of this invention, a 
novel fast recovery extended method 250 (Fig. 2A) is 
used to enhance the performance of TCP fast recovery 
when multiple segment losses occur within a single 
35 round trip time between a TCP sender 200, which is a 
network device, and a TCP receiver 210. Unlike the 
prior art fast recovery technique that worked well in 
only limited circumstances, fast recovery extended 
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method 2 50 enhances recovery for second and subsequent 
segment losses within a single round trip time. 

Fast recovery extended method 2 50 is used by TCP 
sender 200 upon receipt of an acknowledgement of 
5 receipt of new data by TCP receiver 210 that follows 
receipt of at least a predefined number of duplicate 
acknowledgement packets from TCP receiver 210. In 
particular, fast recovery extended method 250 utilizes 
information in fast recovery extended state data 251 

10 that is stored in a memory 202 to determine the best 

way to extend fast recovery. Hence, in contrast to the 
prior art fast recovery that set the number of 
duplicate acknowledgements to zero, deflated the 
congestion window, and terminated following receipt of 

15 an acknowledgement of receipt of new data by TCP 
receiver 210, fast recovery extended method 250 
utilizes the information available to recover more 
efficiently and more smoothly from the loss of data 
over network 2 90. 

2 0 Upon receipt of an acknowledgement packet for the 

segment retransmitted by a first fast transmit 
procedure, fast recovery extended method 250 determines 
whether further processing under fast recovery extended 
is beneficial. In particular, fast recovery extended 

25 method 250 analyzes the number of duplicate 

acknowledgements received immediately prior to receipt 
of the acknowledgement of .new data, and either 
terminates fast recovery, or extends fast recovery 
processing utilizing fast recovery extended state 

30 data 251. 

In particular, if fast recovery extended state 
data 251 indicates that further processing by 
method 2 50 would be advantageous because there appears 
to be a second loss within a single round trip time, 

3 5 method 2 50 adjusts the fast recovery extended state 

data and continues processing to compensate for the 
second loss. The network packet transmission recovery 
actions taken depend upon the values of the adjusted 
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recovery extended state data, as explained more 
completely, below, but in each instance, recovery is 
smoother than with prior art techniques, and better 
data throughput is maintained. Consequently, 
5 network 290 is used more efficiently, and an individual 
user sees enhanced performance of network 290. 

In one embodiment, fast recovery extended 
method 250 is stored in a memory 202 of a network 
device that is referred to as TCP sender 200. Fast 

10 recovery extended method 250 is executed by 

processor 201 at an appropriate time, as explained more 
completely below. An application 225, which also is 
stored in memory 202, executes on processor 201 and 
communicates using the network Transmission Control 

15 Protocol (TCP) over a network 290 with another 

application 221 that executes on a processor 211 in a 
device that is referred to as TCP receiver 210. 

The particular configurations of TCP sender 2 00 
and TCP receiver 210 are not essential to this 

2 0 invention. Either or both network devices can be, for 
example, a computer, a personal digital assistant, a 
network appliance, a set top box, a telephone, or any 
other combination of devices that can communicate over 
a network using TCP. In general, the network device 

2 5 includes a processor, a memory that is volatile, non- 
volatile, or a combination of the two, and a network 
interface. The memory is said to be programmable 
meaning that method 2 50 can be stored in the memory. 


30 stored in memory 202 of TCP sender 200. Moreover, 
while memory 202 is shown as a single unit in 
Figure 2A, those of skill in the art will appreciate 
that memory 2 02 may be a combination of various units 
and method 2 50 may be stored across such units as 

3 5 necessary. For example, method 2 50 may be stored in a 
non-volatile memory and executed directly from that 
memory, may be stored in a non-volative memory and then 
moved in whole or part into a volatile memory for 


Method 250, or at least a part of method 250, is 
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execution, or may be stored only in a volatile memory 
and executed from that memory. Similarly, the 
particular configuration of network 290 is not 
essential to this invention and can be, for example, a 
5 local area network, a wide area network, an intranet, 
the Internet, or any other network architecture or 
combination of network architectures that supports TCP. 

Also, only TCP sender 200 needs to have fast 
recovery extended method 250 available. This is in 

10 contrast to an alternative method, selective 
acknowledgement, which requires that both TCP 
sender 200 and TCP receiver 210 have selective 
acknowl edgement capabi 1 i ty . ( Sel ec t i ve acknowl edgement 
reduces redundant transmissions during recovery but 

15 does not necessarily avoid retransmission timeouts when 
multiple drops occur in one round trip time. This is 
because the procedures defining selective 

acknowledgement do not use selective acknowledgement to 
retrigger transmissions . ) 
2 0 According to the principles of this invention, TCP 

slow start and congestion avoidance initially may 
follow the prior art path. Consequently, fast recovery 
extended state data 251 (Figs. 2A and 2C) , sometimes 
called memory area 251, includes a slow start 

2 5 threshold ssthresh and a congestion window cwnd. 

Upon initiation of congestion avoidance by TCP 
sender 200, each duplicate acknowledgement received 
from TCP receiver 210 (Fig. 2A) is counted and at least 
one variable is stored in fast recovery extended state 

3 0 data 251 that is representative of a count of 

consecutive duplicate acknowledgement packets received 
by TCP sender 200. In one embodiment, the number of 
duplicate acknowledgements received is stored as a 
variable dup_ack. In another embodiment, the number of 
3 5 duplicate acknowledgements received is contained in the 
size of congestion window cwnd. In yet another 
embodiment, both variables are stored in memory 
area 251. 
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10 


15 


20 


25 


30 


When the number of duplicate acknowledgements is 
equal to duplicate acknowledgement 

threshold tcprexmtthresh, a first fast retransmit 260 
is performed, and congestion window cwnd is resized. 
In response to each subsequent duplicate 
acknowledgement in fast recovery 265, the size of 
congestion window cwnd is increased, and if permitted 
another segment is transmitted. The count of 
consecutive duplicate acknowledgements received is 
maintained in memory area 251 (Fig. 2B) . 

When an acknowledgement packet for the 
retransmitted segment is received from TCP 
receiver 210, fast recovery 265 (Fig. 2B) transfers to 
fast recovery extended method 250. 

Get excess operation 271 in method 250 determines 
an excess number of duplicate acknowledgements based 
upon the count of consecutive duplicate 
acknowledgements stored in memory area 251 and the 
number of segments acknowledged in the acknowledgement 
packet for the retransmitted segment . In this 
embodiment, the excess number of duplicate 
acknowledgements is stored in excess duplicate 
acknowledgements variable excess_dup_ack in memory 
area 251. Get excess operation 271 transfers to 
recovery action operation 272. 

In recovery action operation 272., TCP sender 200 
initiates one or more network packet, transmission 
recovery actions based upon the value of excess 
duplicate acknowledgements variable excess_dup_ack . 
The network packet transmission recovery actions 
include : 

Taking no further action; 

Deflating a size of congestion window cwnd; 
Optimizing a size of congestion window cwnd; 
Performing a second fast retransmit; 
Resizing the optimized size congestion 


window cwnd; and 
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Resizing the deflated size congestion window cwnd« 


The specific network packet transmission recovery- 
action or actions taken for a particular value of 
5 variable excess_dup_ack is described more completely 
below. Notice that unlike the prior art ' fast recovery 
method, fast recovery extended method 2 50 uses the 
knowledge concerning the excess number of duplicate 
acknowledgements from fast recovery 265 to take an 

10 appropriate action to expedite recovery for one or more 
segment losses in a single round trip time. 

Moreover, recovery action 2 72 transfers to set 
duplicate acknowledgment operation 273 that effectively 
sets variable. dup_ack equal to variable excess_dup_ack. 

15 Here, effectively means that if variable excess_dup_ack 
is being measured in bytes rather than as an absolute 
number, variable excess_dup_ack is converted to an 
absolute number. 

Thus, the knowledge concerning the excess number 

20 of duplicate acknowledgements is retained for the next 
application of congestion avoidance. For example, if 
the excess number of duplicate acknowledgements is two, 
the next duplicate acknowledgement results in a fast 
retransmission, because another segment has been lost 

25 in the network. Typically, in a similar situation, the 
prior art fast recovery set the number of duplicate 
acknowledgements to zero upon* completing, and so three 
new duplicate acknowledgements would be required before 
a segment would be retransmitted. Consequently, a 

3 0 timeout may have occurred in the prior art method, as 
described above , that is avoided in this example. 

Figures 3A and 3B are a more detailed process flow 
diagram of one embodiment of this invention. When a 
TCP connection is established between TCP sender 200 

3 5 and TCP receiver 210 over network 2 90, data flow is 
initiated across network 2 90 using TCP slow start 
method 301, and so a slow start threshold ssthresh is 
initialized and stored in fast recovery extended state 
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data 251 in memory 202. Similarly, congestion 
window cwnd is initialized and stored in fast recovery 
extended state data 251 in memory 202. Another 
parameter that is initialized is a duplicate 
5 acknowledgement threshold tcprexmt thresh, which is the 
number of consecutive duplicate acknowledgements that 
must be received by TCP sender 200 prior to initiating 
fast retransmit process 260, i.e., a predefined number 
of duplicate acknowledgements. Typically, duplicate 
10 acknowledgement threshold tcprexmtthresh is set to 
three . 

In addition, all normal TCP parameters have been 
initialized including TCP sender's maximum segment size 
maxseg, and TCP receiver's advertised window tiwin. 

15 The initialization of TCP is known to those of skill in 
the art and so is not considered further. See for 
example, Gary R. Wright and W. Richard Stevens, TCP/IP 
Illustrated, Volume 2 , Addison Wesley, Reading, 
Massachusetts, (1995), which is incorporated herein by 

20 reference as an example of the level of skill in the 
art . 

In slow start method 3 01, congestion window cwnd 
is increased each time receipt of new data is 
acknowledged by TCP receiver 210. So long as 
25 congestion window cwnd is less than or equal to slow 
start threshold ssthresh, TCP sender 2 00 remains in 
slow start method 301. Otherwise, TCP sender 200 exits 
slow start method 301 and enters congestion avoidance 
method 3 02 . 

30 In congestion avoidance method 302, TCP sender 200 

monitors the number of duplicate acknowledgements 
received from TCP receiver 210. In particular 
duplicate acknowledge check operation 3 03 determines 
whether an acknowledgement packet received from TCP 

35 receiver 210 is a duplicate acknowledgement. Testing 
for a duplicate acknowledgement is well known in the 
art and so is not considered further herein. See for 
example, Wright and Stevens that is cited above. 
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If the acknowledgement packet is not a duplicate , 
operation 303 transfers to continue 304 and TCP 
sender 200 and receiver 210 communicate according to 
TCP. Conversely, if the acknowledgement packet is a 
5 duplicate, duplicate acknowledgement check 

operation 3 03 transfers to equals threshold check 
operation 305. 

Equals threshold check operation 3 05 compares the 
number of duplicate acknowledgements received with 
10 duplicate acknowledgement threshold tcprexmt thresh. If 
the number of received duplicate acknowledgements is 
less than duplicate acknowledgement threshold 
tcprexmt thresh, operation 305 returns to duplicate 
acknowledgement check operation 303,. and otherwise 
15 transfers to a first fast retransmit operation 260. 
Initially, for this example, assume that the size of 
congestion window cwnd is C segments. 

Also assume that TCP sender 200 has transmitted 
C segments to TCP receiver 210 and cannot send any more 

2 0 segments until an acknowledgement of receipt of new 

data by TCP receiver 210 is received. Instead of an 
acknowledgement of receipt of new data, TCP sender 2 00 
receives three duplicate acknowledgement packets from 
TCP receiver 210. 
25 Upon receipt of the third duplicate 

acknowledgement, TCP sender 2 00 transfers from check 
operation 305 to first fast retransmit process 260. 
TCP sender 200 interprets the three duplicate 
acknowledgements as indicating that a segment has been 

3 0 lost over network 2 90, because, in this example and 

typi cal ly , dupl icat e acknowledgement 
threshold tcprexmt thresh is three. 

In fast retransmit process 260, TCP sender 200 
first sets the value of slow start threshold ssthresh 
35 to one-half the current value of congestion 

window cwnd, with a minimum value of two segments, 
i.e., the value of the current congestion window cwnd 
is saved. A retransmission timer is turned off, and a 
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round trip time is set to zero in case a segment is 
being timed. The pointer to the next segment to 
transmit by TCP sender 200 is set to the byte value 
received in the latest duplicate acknowledgement 
5 packet, and congestion window cwnd is set to the 
maximum segment size for TCP sender 200. 

TCP sender 200 retransmits the missing segment, 
and sets congestion window cwnd to threshold ssthresh 
plus the number of segments TCP receiver 210 has cached 

10 as indicated by the number of duplicate 

acknowledgements received, which is three. The next- 
to-send pointer is moved forward again to address the 
first new segment, i.e., the first segment that has 
never been transmitted. This completes first fast 

15 transmit process 260, and so processing transfers to 
duplicate acknowledgement check operation 307 in fast 
recovery method 265. 

As each additional duplicate acknowledgement is 
received from TCP receiver 210, TCP sender 2 00 

20 transfers processing from duplicate acknowledgement 

check operation 307 to adjust window operation 308. In 
adjust window operation 308, the size of congestion 
window cwnd is increased by the size of one segment. 
Accordingly, the difference between threshold ssthresh 

25 and the size of congestion window cwnd represents the 
number of duplicate acknowledgements received by TCP 
sender 2 00. 

Herein, it is assumed that a segment and the 
maximum transmission unit are the same size. However, 

3 0 if the maximum transmission unit is different from a 
segment, method 250 is defined in terms of maximum 
transmissions units. In general, TCP always sends data 
as maximum transmission units unless the size of the 
data waiting to be sent is less than the size of a 

35 maximum transmission unit. 

Operation 308 transfers to transmit check 
operation 309. If congestion window cwnd is at least 
one segment larger than the number of unacknowledged 
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packets, operation 309 transfers to transmit 
operation 310 and otherwise to duplicate 
acknowledgement check operation 307. The reduced size 
of congestion window cwnd prevents TCP sender from 
5 immediately transmitting additional segments, and 
processing cycles through operations 307 to 309. 

After (C/2 -2) duplicate acknowledgements, adjust 
window operation 308 increases congestion window cwnd 
to a size of (C+l) segments, and so transmit check 

10 operation transfers to transmit operation 310 in which 
TCP sender 200 transmits another segment. Hence, the 
number of unacknowledged segments is (C+l) segments. 
For each additional duplicate acknowledgement received 
by TCP sender :200, processing goes, from check operation 

15 to adjust window operation 3 08 in which congestion 
window cwnd is incremented. Consequently, another 
segment is transmitted in operation 310. This sequence 
of operations 307 to 310 continues until (C-D) 
duplicate acknowledgements are received by TCP 

20 sender 200 at which time ( (3C/2) -D) unacknowledged 
segments have been transmitted, and congestion 
window cwnd has a size ( (3C/2) -D) segments, where D is 
the number of segments dropped in one round trip time. 

At this point, TCP receiver 210 has sent duplicate 

2 5 acknowledgements for all the segments sent prior to the 
receipt of the retransmitted segment. When TCP 
receiver 210 receives the retransmitted segment, TCP 
receiver 210 not only acknowledges the retransmitted 
segment, but also each additional consecutive segment 

30 in the buffer of TCP receiver 210, which in this 

example is X additional segments. Consequently, the 
acknowledgement packet from TCP receiver 210 that 
acknowledges new data acknowledges the retransmitted 
segment and co- acknowledges X additional segments. 

35 In the prior art at this point, duplicate 

acknowledgement check operation 3 07 would transfer to 
an operation in which TCP sender 200 set the number of 
duplicate acknowledgements to zero, and set congestion 
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window cwnd equal to slow start threshold ssthresh. 
Consequently, all information concerning the number of 
duplicate acknowledgements that was stored in 
congestion window cwnd was lost in the prior art fast 
5 recovery method. 

However, as indicated above, fast recovery 
extended method 250 of this invention, does not 
automatically terminate upon receipt of an 
acknowledgement packet for the retransmitted segment . 

10 Prior to considering the specific operations performed 
when duplicate acknowledge operation 307 detects other 
than a duplicate acknowledgement, consider that at this 
point in time, TCP sender 200 cannot be sure of the 
relationship between the number of duplicate 

15 acknowledgements received, and the number X of co- 
acknowledged segments. While it is true that TCP 
receiver 210 generated a duplicate acknowledgement for 
each of the X co-acknowledged segments, some of the X 
duplicate acknowledgements may have been lost in 

2 0 network 2 90 (Fig. 2A) . Conversely, some of the 
duplicate acknowledgements, may be for segments 
following the X co-acknowledged segments. 

TCP sender 2 00 does know, however, that TCP 
receiver 210 must have received the larger of the 

2 5 number of duplicate acknowledgements and the number of 

co -acknowledged packets. Hence, according to the 
principles of this invention, TCP sender 200 uses this 
knowledge to determine the action to be taken in 
response to the acknowledgement packet that 

3 0 acknowledged the retransmitted segment and X additional 

segments . 

Specifically, in segments acknowledged 
operation 321 in fast recovery extended method 250, TCP 
sender 200 determines the number of co- acknowledged 
35 segments, e.g., X co- acknowledged segments in this 
example. Operation 321 transfers to initialization 
operation 322 . 
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Initialization operation 322 sets a next duplicate 
acknowledgements variable to zero. The next duplicate 
acknowledgements variable is a key to the fast recovery 
extended because this variable retains the information 
5 about the excess number of duplicate acknowledgements 
that is -stored in congestion window cwnd, as explained 
more completely below. Initialization operation 322 
transfers processing to inflated window check 
operation 323 . 

10 Inflated window check operation 323 determines 

whether congestion window cwnd was increased in size 
after the three duplicate acknowledgments were 
received. If congestion window cwnd was not inflated, 
processing transfers to set duplicate acknowledgements. 

15 operation 273 and otherwise to get excess 
operation 271. 

In get excess operation 271, TCP sender 200 first 
determines the excess number of duplicate 
acknowledgements. Specifically, operation 324 

2 0 determines the inflation in congestion window cwnd and 

then subtracts the total number of acknowledged 
segments to determine the excess number of duplicate 
acknowledgements. Operation 271 transfers to excess 
check operation 325. 

25 In particular, if the excess number of duplicate 

acknowledgements is greater than or equal to one, 
excess check operation 325 transfers' processing to 
optimize window operation 326, and otherwise to deflate 
window operation 327. If excess check operation 325 

30 transfers to deflate window operation 327, the co- 
acknowledgement completely accounts for the duplicate 
acknowledgements previously received, and so the count 
of duplicate acknowledgements conveys no additional 
information. Thus, extended recovery operations are 

3 5 not warranted, and deflate window operation sets 

congestion window cwnd equal to slow start 

threshold ssthresh, and transfers to window okay check 
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operation 330, which is described more completely 
below . 

However, if the number of duplicate 
acknowledgements is greater than the number X of co- 
5 acknowledged segments, TCP sender 2 00 knows that TCP 
receiver 210 has received segments which cannot yet be 
explicitly acknowledged. This may be because a second 
segment was lost in network 2 90, or some segments were 
received out of order. Consequently, unlike the prior 

10 art that simply dropped the information concerning the 
excess duplicate acknowledgements, excess check 
operation 325 transfers to optimize window 
operation 326, as indicated above. 

As in the prior art fast recovery method, the 

15 intent of fast recovery extended method 250 is to keep 
the number of packets in flight constant until recovery 
is complete. The excess number of duplicate 
acknowledgements is grounds for considering 
retransmitting another segment. However, congestion 

2 0 window cwnd should be reduced since network 2 90 is 

still experiencing congestion as indicated by the loss 
of another segment. Preferably, the reduction in 
congestion window cwnd does not result in a congestion 
window less than the quantity of unacknowledged data 

2 5 when recovery is complete, and does not result in 

transmission of multiple segments at wire speed. With 
these criteria, congestion window, in bytes, is defined 
in optimize window operation 326 as: 
New congestion window = 

3 0 old congestion window - (number of co- 

acknowledged segments + 1) - (0.5) (TCP sender 
maximum segment size) . 
Operation 326 transfers to excess greater than or 
equal, to threshold check operation 327. Operation 327 
35 compares the excess number of duplicate 

acknowledgements with duplicate acknowledgement 
threshold tcprexmt thresh. If the excess number of 
received duplicate acknowledgements is less than 
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duplicate acknowledgement threshold tcprexmt thresh, 
operation 32 6 transfers to window okay check 
operation 330, and otherwise transfers to fast transmit 
operation 339. 


is greater than threshold tcprexmtthresh, another 
segment has been lost and so immediate re- transmission 
of the oldest unacknowledged segment is justified. 
Consequently, fast retransmit operation 32 9, which 

10 performs the same operations as fast retransmit 

operation 260, immediately retransmits the oldest 
unacknowledged segment. Notice that the second 
transmission effectively happens immediately after the 
receipt of the acknowledgement of the first 

15 retransmitted segment, and does not have to wait for 

receipt of three additional duplicate acknowledgements 
after fast recovery is terminated as in the prior art. 
Fast retransmit operation 260 transfers to window okay 
check operation 330. 

20 In window okay check operation 330, if congestion 

window cwnd is greater than the number of outstanding 
unacknowledged segments plus three segments, 
operation 330 transfers to resize operation 331 and 
otherwise to set dup acks operation 273. In resize 

25 operation 331 congestion window cwnd is resized so that 
a maximum of two segments can.be. transmitted in a' row, 
e.g., congestion window cwnd. is set to the number of 
unacknowledged segments plus three segments minus one. 
Operation 331 transfers to set dup acks operation .273 . 

30 In set dup acks operation 273, the excess number 

of duplicate acknowledgements is converted to an 
absolute number, if necessary, and stored as the next 
duplicate acknowledgements variable. Next duplicate 
acknowledgements variable is set equal to the number of 

35 duplicate acknowledgements dup_ack, in contrast to the 
prior art that set the number of duplicate 
acknowledgement to zero on leaving the prior art fast 
recovery method. The retention of the number of 


5 


If the excess number of duplicate acknowledgements 
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duplicate acknowledgements permits the next pass 
through congestion avoidance in response to the next 
duplicate acknowledgement to respond more quickly than 
in the prior art . 


acknowledgements is two, and duplicate acknowledgement 
check 303 detects another duplicate acknowledgement, 
threshold check operation 305 transfers to fast 
retransmit operation 260. Hence, it is no longer 

10 necessary upon leaving method 250 that congestion 
window cwnd permit transmission of three segments 
before another segment can be retransmitted. 
Accordingly, a retransmission timeout that would have 
occurred in the prior art is no longer a problem. 

15 Using the knowledge provided by the excess number of 
duplicate acknowledgements allows a more smooth 
recovery than was previously possible, and data 
throughput is maintained. Consequently, the fast 
recovery extended method of this invention makes more 

2 0 efficient use of the network and improves throughput 
for the individual user. 


5 


For example, if the extra number of duplicate 
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SECTION I 


An embodiment of the fast recovery extended method 
5 of this invention follows in the C programming 

language. This embodiment is illustrative only and is 
not intended to limit the invention to the particular 
computer code presented herein. 

10 if (SEQ_LEQ ( ti - >t i_ack, tp- >snd_una) ) { 

if (ti->ti_len == 0 && tiwin ~ tp->snd_wnd) { 
tcpstat . tcps_rcvdupack++ ; 
if (tp->t_timer [TCPT__REXMT] == 0 | | 
ti->ti_ack != tp->snd_una) 
15 tp- >t_dupacks = 0: 

else if ( ++tp- >t_dupacks == 

tcprexmtthresh) { 
/* tcprexmtthresh is set to 3 in one embodiment*/ 

tcp_seg onxt = tp->snd_nxt ; 
2 0 unsigned int win = 

min ( tp- >snd_wnd, 
tp- >snd_cwnd) / 2 / 
tp - >t__maxseg ; 
if (win < 2) 

2 5 win = 2; 

tp- >snd_ssthresh = win * 
tp- >t_maxseg; 
tp->t_timer [TCPT_REXMT] = 0; 
tp->t_rtt = 0; 

3 0 tp->snd_nxt = ti->ti_ack; 

t p - > s nd_cwnd = tp- >t_maxseg ; 
(void) tcp_output (tp) ; 
tp- >snd_cwnd=tp- >snd_ssthresh+ 
tp->t_maxseg*tp->t_dupacks ; 
35 if (SEQ_GT (onxt , tp- >snd_nxt ) ) 

t p - > snd_nxt = onxt ; 
goto drop; 
} else if (tp- >t_dupacks > 
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10 


tcprexmtthresh) { 
tp->snd_cwnd += tp- >t_maxseg ; 
(void) tcp_output ( tp) ; 
goto drop; 

} 

} else 

tp- >t_dupacks = 0; 
break; 


} 


/* compute amount of data acked by this ack*/ 
acked = ti->ti_ack - tp->snd_una; 
/* default value to assign to t_dupacks */ 
unsigned int next_dup_acks = 0 ; 
15 if (tp- >t_dupacks > tcprexmtthresh && 

tp- >snd_cwnd > tp- >snd_ssthresh) { 
/* progressive ack after dup acks and retransmission*/ 
/* hold off further increasing cwnd for now*/ 
incr = 0; 

2 0 int excess_dup_ack = 

tp->snd_cwnd - tp- >snd_ssthresh - acked 
if (excess_dup__ack< (int) tp- >t_maxseg) { 

/* Normal case: no excess dupack count worth using*/ 
tp- >snd_cwnd = tp- >snd_ssthresh; } 
25 else { 

/* cwnd is reduced to. match, the reduction in the 

// quantity of unacked data. Thus, congestion window 

// is chosen as 

// cwnd -= acked - maxseg/2; 

3 0 // combined with ensuring tp->snd_cwnd is not too 

// high at the end, i.e., making 

/ / cwnd < = snd_nxt - snd_una 

+ tcprexmtthresh*maxseg - 1 

// */ 

35 tp->snd_cwnd -= acked - tp- >t_maxseg/2 ; 

next_dup_acks = exc e s s_dup_a c k / 

tp - >t_maxseg ; 
if (next_dup_acks >= tcprexmtthresh) { 
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/* There must have been a second drop, 

// Retransmit the oldest unacknowledged packet 

*/ 

tcp_seq onxt = tp- >snd_nxtp ; 

5 /* Save congestion avoidance state*/ 

unsigned int save_cwnd = tp- >snd_cwnd; 

unsigned int win =min ( tP- >snd_wnd, 

tp- >snd_cwnd) / 2 / 

tp- >t_maxseg ; 

10 if (win < 2) 

win = 2 ; 

tp- >snd_ssthresh = win * 

tp- >t_maxseg ; 

tp- >snd_cwnd = tp- >t_maxseg ; 

15 tp->snd_nxt = ti->ti_ach; 

(void) tcp_output ( tp) ; 

if (SEQ_GT (onxt , tp- >snd_nxt ) ) 
tp - >snd_nxt = onxt ; 

/* Restore congest ion window*/ 
2 0 tp- >snd_cwnd = save_cwnd; 

} 

} 

if (tp- >snd_cwnd>= tp->snd_nxt - tp->snd_una + 

t cprexmt thresh* tp- 

2 5 >t_maxseg) { 

tp->snd_cwnd = tp->snd_nxt - tp- >snd_una 
tcprexmtthresh*tp- >t_maxseg - 1 ; 

} 

} 

3 0 tp- >t_dupacks = next_dup_acks; 

if ( SEQ_GT ( t i - > t i_ack , tp- >snd_max) ) { 
tcpstat . tcps_rcvacktoomuch++ ; 
goto dropaf terack ; 

} 

3 5 tcpstat . tcps_rcvackpack++ ; 

tcpstat . tcps__rcvackbyte += acked; 
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